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THE  CONSTRUCTION  AND  FITTING  OF  SOME  SIMPLE  PROBABILISTIC  COMPUTER  MODELS 


Donald  P.  Gaver 
Naval  Postgraduate  School 


1.   Introduction. 


The  purpose  of  this  report  is  to  develop  several  analytical 
(probabilistic)  computer  system  models  in  a  form  that  allows  them  to  be 
"fitted"  to  actual  measurement  data.   Some  of  these  models  have  been 
validated  by  comparison  with  runs  of  actual  sample  programs;  in  general, 
the  agreement  achieved  has  been  reasonably  good,  and  suggestions  are  made 
here  of  ways  in  which  the  models  may  be  improved  so  that  agreement  is  likely 
to  be  better.   Further  measurements  are  being  made,  and  the  success  with  which 
they  can  be  described  by  models  will  be  reported  later. 

The  measurement  data  referred  to  has  been  taken  on  a  Honeywell  6000- 
series  computer  at  the  Joint  Technical  Support  Activity  (JTSA) ,  a  field 
activity  of  the  Defense  Communications  Agency  in  Reston,  Virginia.   The 
measurements  were  made  by  Mr.  Barry  Wallack  of  the  JTSA,  to  whom  I  am  grateful 
for  enthusiastic  and  careful  cooperation. 


2.   Basic  Assumptions. 

The  mathematical  assumption  that  pervades  our  analysis  is  that  our 
systems  act  as  if  they  are  Markovian.   That  is,  at  any  instant  of  time,   t, 
we  can  describe  the  system  state  by  a  random  vector   (N  (t) ,N  (t) , . . . ,N  (t)) 
=  N(t);   furthermore,  knowledge  of  the  value  of  N(t)   allows  us  to  calculate 
all  future  event  probabilities.   Concretely,   N1 (t)   may  denote  the  number  of 
programs  awaiting  and  undergoing  CPU  processing  at   t,   while  N~(t) ,N  (t) ,  . .  . 
represent  the  number  at  each  of  several  different  memory  devices  (e.g.  disk 
units  or  channels  thereto,  tape  units,  etc.).   The  Markovian  assumption  made 
is  equivalent  to  assuming  that  independently  and  exponentially  distributed 
"burst  times"  on  CPU  and  memory  devices  occur.   This  assumption  cannot  be 
rigorously  true,  but  derived  results  are  often  insensitive  to  this  assumption 
(see  formulas  for  the  number  present  in  steady  state  in  the  infinite  server 
queue;  reference  [8]).   Furthermore,  the  usual  measurements  of  computer  system 
performance  only  allow  identification  of  a  single  rate  parameter  for  a  device. 
Experiments  with  realistic  workload  material  will  be  used  to  check  the  validity 
of  this  assumption  as  it  reflects  upon  system  performance  characteristics  of 
interest,  such  as  device  utilization,  and  system  throughput. 

The  reason  for  making  the  Markovian  assumption  is,  in  the  final  analysis 
to  obtain  simple,  easily  understood  expressions  that  involve  a  few  basic 
parameters.   Sensitivity  of  system  performance  to  departures  from  this  type  of 
assumption  can  sometimes  be  effectively  assessed  by  use  of  another  approach, 
that  of  diffusion  approximation  analysis. 


3.   Simple  Configurations. 

Actual  computer  systems,  such  as  the  Honeywell  6000  and  the  IBM  360, 
must  be  represented  as  complex  arrays  of  serving  points  and  queues.   Before 
proceeding  to  complex  models,  however,  I  will  describe  some  much  simpler  ones. 
The  implications  of  these  can  be  easily  compared  to  the  results  of  running 
simple  but  realistic  program  material  through  an  actual  system.   Such  valida- 
tion attempts  are  currently  in  progress  at  JTSA,  conducted  by  Mr .  B.  Wallack. 

Configuration  1:   One  Processor,  One-Channel  to  Disc. 

Suppose  one  processor  (CPU)  services   J   programs  that  occupy  core 
Assume  that  the  reason  that  a  continuous  period  of  CPU  activity  on  a  particular 
program  (a  "CPU  burst"  for  short)  terminates  is  the  requirement  for  information 
contained  on  a  disc.   The  latter  is  accessed  through  a  single  channel.   Because 
of  the  presence  of  other  programs  in  the  system,  there  may  well  be  queueing 
for  the  channel.   After  this  queueing  is  completed,  the  program  accesses  the 
disc,  reads  out  the  information,  and,  at  the  completion  of  the  10  activity, 
assumes  a  place  in  the  CPU  queue.   The  process  continues  until  all  computation 
is  completed  and  the  program  exits,  to  be  instantly  replaced  by  another.   Note 
that  disc  mount  requests  may,  in  the  IBM  system,  delay  the  acceptance  of  new 
jobs  into  core.   This  effect  will  be  modeled  later;  its  effect  is  not  present 
in  the  usual  simple  cyclic  models  for  multiprogramming  systems. 

Several  simplifying  assumptions  are  notable: 

(a)  there  are  a  fixed  number,   J,   of  programs  in  core, 

(b)  computation  and  disc  activity  do  not  overlap,  i.e.  I  do  not  here  admit 
the  possibility  (which  apparently  exists)  that  CPU  and  disc  activity 
can  simultaneously  occur  on  the  same  program, 

(c)  operating  system  requirements  on  the  CPU  (overhead)  are  not  explicitly 
included, 


(d)   Programs  are  homogeneous  enough  to  be  described  statistically  in  a 
simple  fashion. 

We  allow  for  these  assumptions  when  the  model  is  compared  to  actual  computer 

runs  of  simple  program  material. 

Our  configuration  appears  as  below: 


O  0  0[  O  (CPU)  1 X>     O 


Q(DISC) 


\  Channel 


Configuration  1. 


Analysis .   A  Markov  model  of  this  system  requires  the  specification  of  the 

rate  parameters  of  CPU  and  Channel-Disc:   denote  the  expected  length  of  a 

CPU  burst  by  X      ,   and  of  a  Disk  burst  (actual  read  time  from  Disc)  by  u 

Furthermore,  the  system  state  can  be  taken  to  be  N(t),   the  number  of  programs 

at  the  CPU  state — waiting  plus  being  served.   Obviously  J-N(t)   are  then 

present  at  the  Disc  stage.   It  may  then  be  shown  (see  [3],  Gaver  and  Thompson) 

that  (i)  there  is  a  long-run  or  steady-state  probability  distribution  for 

N(t): 

p  =  lim  P{N(t)=n} 
n 


where  P{      }   denotes  a  probability  measure.   Furthermore 

n 
Pn  =  P0^   ; 


0  £■  n  £.  J 


Pn  = 


1  "  ©J+1 


(3.1) 


(3.2) 


Discussion.   Several  points  can  be  made, 


(a)   The  value  of   p    is  the  long-run  fraction  of  time  there  are  exactly   n 
(n=0,  or_  1,  or  2,  ...  or  J)   programs  at  the  CPU  stage.   If   X  >  M   then 
the  jobs  tend  to  be  10  bound.   If  core  size  is  increased,  meaning   J 
is  increased,  then  CPU  idleness  decreases  to  a  positive  lower  bound: 

J  =  1,     CPU  idleness  =  p_  = 


0    A+y 
J  =  °°,      CPU  idleness  =   p=l-~->^ 

U  A 

If   A  <  y,   meaning  the  jobs  tend  to  be  CPU  bound,  then  CPU  idleness 
decreases,  but  now  to  zero: 

J  =  1,      CPU  idleness  =  p   = 


0    A+y 
J  =  °°,     CPU  idleness  =  p  =  0. 

(b)   The  system  productivity  may  be  assessed  as  follows.   Let  program   i 

require   b.   CPU  bursts,  each  of  expected  length  A   .   Then  the  expected 
time  to  finish  k  programs  is 

b    b         b       k 

t  +  t+  •••+ir  =  M  bi  <3-3) 

i=l 

The  expected  amount  of  CPU  time  available  in  a  time  period  T   is  (approxi- 
mately) equal  to  T(l-p  ) .   Thus   T  must  be  such  that 


T(1~V  "  I  I  bi  (3-4) 


k 

I 
i=l 
or 

k 

T 


=  W11   ,    I     b.  (3.5) 

A(l-p»)  .L^      l 


-pcr    i-i 

If  the  number  of  bursts  per  program  is  thought  of  as  random  with  mean  b 
then 


T  ■  k©(I^) 


0' 
=  (Number  of  Programs  to  be  Processed)  *  (Mean  CPU  Time  per  Program) 

*  (CPU  utilization) . 

The  expression  (3.6)  is  increasingly  accurate  as  k  becomes  large  and 
as  program  durations  become  more  homogeneous. 

(c)  Since   p„   depends  only  upon  the  ratio  y-»   an<^  since  the  number  of  CPU 
bursts  nearly  equals  the  number  of  CPU  bursts  for  each  program 

Mean  CPU  Time  per  Program  _  b/X   v  .        . 

Mean  Disc  Time  per  Program   b"/u   X 

The  left-hand  side  of  (3.6)  can  be  estimated  by  running  a  set  of  represent- 
ative programs  through  an  actual  system  and  utilizing  a  monitor.   This 
has  been  carried  out  at  JTSA  on  the  Honeywell  6000.   Since  actual  CPU 
activity  includes  overhead,  and  the  latter  is  not  explicitly  modeled 
(see  Lewis  and  Shedler,  [6],  for  an  explicit  model  in  the  IBM  context) »  the 
rate   X  must  be  reduced  to  represent  overhead. 

(d)  The  expected  number  of  programs  at  the  CPU  stage  is,  in  the  long  run, 
given  by  the  formula 

U-(f)J[i+j(i-^ 

E[N]  =  lim  E[N(t)]  =  Pn  7  < L " — 

t-~       °M         (i-f)2 

Of  course 

E [number  at  Channel]  =  J  -  E [number  at  CPU] 

=  J  -  E[N]. 


Configuration  2:   One  Processor,  Two  Channels  to  Disjount  Groups  of  Discs. 

Again  we  let  one  processor  servp   J   programs  in  a  multiprocessing 
fashion.   However,  after  a  program  burst  terminates  we  assume  that  the  program 
requires  information  from  one  of  a  group  of  Discs,  each  group  being  accessed 
through  its  own  dedicated  (single)  channel.   For  the  moment  we  discuss  only 
the  two-channel  case.   The  configuration  is  as  shown  below. 


c?o 

J*>- 

> 

Under  the  assumptions  (i)  that  each  program  that  leaves  the  CPU 
stage  (burst  terminates)  reports  at  random — with  appropriate  probabilities, 
a    and  a  ,   not  necessarily  equal  to  a   — to  Channel  1  and  Channel  2,  and 
(ii)  that  all  burst  time  distributions  are  exponential,  we  can  analyze  the 
system  by  means  of  the  Gordon-Newell  cyclic  queueing  model,  [  5  ]•   It  turns 
out  that  several  relevant  measures  of  system  performance  can  be  explicitly 
written  down  in  parametric  form  for  the  present  situation;  one  does  not  need 
the  more  comprehensive  methods  of  Buzen  [  1  ] . 
(a)   The  Long-Run  CPU  Idleness  Probability. 


P{CPU  idle}  =  (1-p  )(l-p2) 


J+l    J+l 
P2    "Pl 


J+9         j+2 
P2-P1+(1-P2>P1  "-(l-p1)p2 


(3.8) 


where 


P.-  = 


ai    [Channel  i  (disc)  Burst  Time]  x  [Probability  Channel  i  Selected] 


CPU  Burst  Time 


i  =  1,2,   provided  p   ^  p2 
If  p   =  p  ,   then 


P{CPU  idle|P;L=p2=p}   = 


(l-p)2(J+l)p' 


1  -  pJ+1[l+(J+l)(l-p)] 


(b)   Long-Run  Channel  1  Idleness  Probability. 


J+l, 


P(Channel  1  idle)  =  (1-p  )(l-p   ) 


P2  -  Pl 


,,.    s   J+2  ,.    s  J+2 
P2-P1+(1-P2)P1  -(1-P1)P2 


(3.9) 


(3.10) 


A  corresponding  expression  for  Channel  2  is  obtained  by  simply  inter- 
changing p..   and  p-   in  the  above  formula. 

(c)   Long-Run  Expected  Number  at  Channel,  and  at  CPU,  Stages. 

It  can  be  shown  that  the  expected  number  at  the  channel  stage,  both 
enqueued  and  in  process  of  device  access,  is  given  by  the  expression 


E(Number  at  channels}  = 


(1-P2)(l-Pl) 


.  ,-    v  J+2  ,_    .  J+2 

p2-p1+(i-p2)p1   -(i-p1)p2 


(^(j+Dp^+jp^1 

A    d-p2)2 


-  p 


(l-(J+l)p^+JP^+1 


(1-p^2 


(3.11) 


Of  course 

E[number  at  CPU]  =  J  -  Efnumber  at  Channels]. 

Note  that  the  above  formulas  do  not  give  the  individual  occupancies 
(queues  plus  those  in  service)  at  the  separate  channels,  but  only  the  sum 
of  those  occupancies. 
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(d)   Long-Run  Expected  Number  at  Channel  1. 

It  can  be  shown  that  the  expected  number  of  programs  at  Channel  1 
is  given  by  the  expression 


E [number  at  Channel  1]  = 

;i-(j+i)p^+jPf+1j      T(i-(j+i)(Pl/pjJ+j(Pl/pn)J+1)i 

(3. 


(i-p1)(p2-p1) 


j-s,         ^  J+2  ft         >  J+2 
P2-P1+(1-P2)P1   -(1-P1)P2 


'if        (1-Pl)2     piP2p      d-P1/p2)T" 


The  expected  number  present  at  Channel  2  can  be  obtained  by  reversing   p 
and   p    in  this  formula. 

Note  that  this  expression  and  the  one  given  previously  may  not  describe 
the  (expected)  number  present  if  the  latter  number  is  obtained  from  obser- 
vations made  at  special  instants  of  time.   For  instance,  if  the  Channel 
occupancy  is  counted  just  before  each  new  arrival  to  its  queue  occurs 
one  would  anticipate  an  average  lower  than  that  resulting  from  counts 
made  just  after  each  new  arrival:   think  of  the  case   J  =  1,   for  instance. 
Consequently  the  formulas  in  (c)  and  (d)  may  require  adjustment  in  order 
account  for  sampling  techniques  actually  used.   As  they  stand,  they 
describe  the  expected  channel  occupancy  when  the  channel  is  observed 
either  continuously  through  time,  or  at  uniformly  random  time  points. 


Configuration  3:   One  Processor,   c   Channels  to  Disjoint  Groups  of  Discs 

The  figure  below  depicts  a  very  common  multiprogramming  computer 
configuration  at  least  in  an  approximate  manner. 


A  program  or  job  experiences  a  burst  of  mean  duration  A  at  the  CPU  and 
then  selects  a  Channel  to  a  group  of  devices;  Channel  i  is  selected  with 
probability  a,,  independently  of  ail  previous  events.  Thus  each  Channel 
has  its  own  queue,  and  the  CPU  also  has  a  queue.  The  service  rate  on  Channel 
i  is  u . .  There  are  assumed  to  be  J  jobs  simultaneously  in  the  system, 
and  we  study  the  process  when  it  is  in  the  equilibrium  state. 
Analysis:   The  Gordon-Newell  Cyclic  Queue  Model. 

In   [5]   Gordon  and  Newell  present  a  general  solution  to  the  multi- 
dimensional Markov  process  describing  the  above  setup.   Letting  N.   be  a 
random  variable  denoting  the  number  present  at  Channel   i   in  equilibrium  we 
can  show  that  for  the  present  configuration 


p(n   ,n   , ...,n   )    =  P{N  =n   ,N  =n0 , . . . ,N  =n   } 
Iz  c  112      2  cc 

n1  n  n 

=  K(J)p      p      ...p    C,  n     +  n     +   ...   +  n     ^  J. 

12c  12  c 

where  K(J)   is  determined  by  normalization,  i.e.  by  the  condition  that 


(3.13) 
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y   •••  l  p(n1,n2, . -.,nc)  =  1.  (3.14) 

n1+n_+. . .+n  ^J 
12      c 

We  shall  derive  an  easily  computed  explicit  formula  for  K(J)   in  the  present 

case.   Let 

Xa. 
P±  =  -^   »     i  =  1,2,. ..,c  (3.15) 

i 

the  traffic  intensity  parameter  for  the  single-server  queue  at  Channel   i. 

Recall  that  if  Channel   i  were  confronted  by  a  Poisson  arrival  rate  of   Xa, 

J  x 

then  the  equilibrium  distribution  is  geometric  (if   p.  <  1) ,   and  indeed  if 

J  becomes  very  large,  so  that  the  CPU  is  constantly  busy,  then  it  is  just 

such  an  equilibrium  distribution  that  is  seen.   In  fact,  we  can  write 

n        n  n 

(l-p1)p1i(l-p.)p/...(l-p  )p  C 
P{N  =n.,N  =n.,...,N  =n  N+N-+. . .+N  £J}  = — — — —    (3.16) 

12.  C  C  C  r**"         **"  'v/       -l 

P{Nn+N0+...+N  *J} 
12      c 

where   N.   is  the  state  variable  of  an  unlimited  Poisson  arrival  queue,  and 
note  that 

(1-p  )(1-P2) . . . (1-P  )   ni  n2  nc 

p(n  ,n  ,  .  .  .  ,n  )  =  p   p   p   .  (3.17) 

2       c        «■***  ***      *"   i        2   c 
P{N1+N„+.. .+N  *-J} 
1   2      c 

Thus  all  that  is  necessary  to  find  the  normalizing  constant  is  to  find  the 

distribution  of  the  sum  N.  +  N_  +  . . .  N   =  N.   This  step  is  facilitated  by 

12  c 

taking  generating   functions: 

N.  °°         n.    n.  1-p 


E 


[z   X]    =    (1-p.)       I       z   \x   =  — ^-  ,  (3.18) 


n.=0 

l 


and    thence,    by    the    convolution    theorem 


~  c        1-p . 

Etz    J    °      n      i_n    ^      =   g<z)-  (3-19) 

i=l   L  PiZi 


1  1 


Now  suppose  all  p.   are  numerically  different,  as  will  often  be  true.   It  is 
convenient  to  carry  out  a  partial  fractions  development  as  follows:   rewrite 
g(z)   as 

c    a 


s(z)  "  S  1^7  «-20) 


i-i  --pi2 


and  then  determine  a.  as  follows:  multiply  g(z)  by  1  -  p.z  and  let 
z  -*  p .  .  From  the  product  form  for  g(z)  we  obtain  after  cancelling  the 
term  going  to  zero, 


n   (l-p±) 

— — =  a.  (3.21) 

c  ( -\  x 

n  u;(i-p./p.) 
j-i     J  x 

where  the  right-hand  side  comes  from  cancelling  off  the  denominator   1  -  p.z 

c  ( •  \ 
term  against  the  numerator  of   (l-p.z)g(z);    II      means  the   i —  factor 

1         i=l 
is  omitted  from  the  product.   Thus  we  finally  have  g(z)   completely  deter- 
mined, and  a  term-by-term  geometric  expansion  gives  the  probability  distribution 

of  N: 

c 


Next 


P{N=n}  =  I     a.pn  .  (3.22) 

i=l 


c  n  ,     c  [l-(p  )n    ] 

P{N^n}  =  I     a.  I  p^     =  I     a.  — : (3.23) 

.*•_   i  ,L  n  i    .*•_  i   1  -  p. 

i=l  n  =0  i=l  l 


and  we  can  put  n  -  J   to  obtain  the  denominator  of  the  expression  for 
p(n..,...,n  ),   and  hence  the  normalizing  constant.   The  explicit  expression  is 
thus 
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p(n1,n2, •••»nc)  = 


c  il-p 

I 

i=l 


J+l 


1-P 


11 

y-i 


(i) 


P.-P  . 
i  J. 


-1 


nl  n2 
Pl  P2  ' 


(3.24) 


nn  +  n„  +  .  .  .  +  n  £  J. 
12  c 


Several  formulas  for  useful  operational  measures  can  easily  be  derived, 

e.g.  from  the  observation  that  the  d.f.  of  N  =  Nn  +  .  .  .  +  N    is  the  same  as 
°  1  c 


r+*  r±* 


that   of      N     +    ...    +  N      =  N,      given     N  ^  J.      For   instance,    the   probability    that 
the    CPU   is    idle   is    the   probability    that      N   =   J,      so  all   jobs    are   at    the 
Channel   stage;    now   in   order    to   obtain  a    formula,    "rite 


P{N=J>    =      T      a. p. 
i-1      X   X 


and   then 


(3.25) 


P{N=J}   =  PfN^lN^J}1  = 


1    a-p- 

i=l  x  x 


1-P 


J+l 


Y    J 
L     a-P  • 

i-1  X   X 


i=l  *  ^i 


J+l 


1-  I 


i  1-P. 
i=l      i 


(3.26) 


In  summary 


(a) 


PQ(c,J) 


V   a. p. 
1=1 


J+l 


.  ,   i  1-f 


i=l 


(3.27) 


is  the  long-run  fraction  of  time  the  CPU  is  idle  in  a  c-Channel  system  with 
J   jobs  in  core.   Here 
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ai = 


n  (i-p.) 

i=l 


n 


(i) 


lP±-PjJ_ 


(3.28) 


where   II     represents  the  product  with  the  term  j  =  i  omitted. 

(b)  The  total  Turnaround  Time  for  servicing   k  programs  equals  the  Total  CPU 
Time,  divided  by   1  -  p  (c,J). 

(c)  The  system  under  study  is  closed  (actually,  completed  programs  are  assumed 
to  leave  and  then  to  be  instantly  replaced) ,  and  so  a  long-run  steady  state  is 
reached.   For  such  a  Markovian  system  we  know  that  a  condition  of  detailed 
balance  exists: 

P{CPU  Busy}Ap.  =  P{ Channel   i   Busy}y..  (3.29) 

To  explain  this  intuitively,  suppose  we  run  the  system  for  a  long  time,  _t. 
Then  the  CPU  is  busy  (not  idle)  for  a  time  approximately  equal  to 

t-  P{CPU  Busy}  =  ^[l-p0(c,J)];  (3.30) 

the  latter  is  a  consequence  of  renewal  theory  arguments;  see  Cox  [2].   Now 
at  any  moment  during  a  CPU  busy  time  a  departure  occurs  with  approximate 
probability  Xdt,   and  that  departure  represents  a  program  destined  for 
Channel  i  with  probability  a..   Consequently  the  long-run  expected  number 
of  programs  that  flew  towards  Channel   i   is 

t  P{CPU  Busy}Xa. 

But  the  long-run  expected  number  of  programs  that  flow  from  Channel   i   to  the 
CPU  is  _t  P{ Channel  i  Busy}y.,   for  all  programs  that  leave  Channel   i  go 
back  to  the  CPU.   And  we  must  have  balance: 
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Number  of  Programs  from  CPU  to  Channel  i  = 

Number  of  Programs  from  Channel   i   to  CPU 

in  the  long  run,  else  a  buildup  will  occur  at  one  point  or  the  other.   Thus 

t  P{CPU  Busy}Xa.  =  t  P{Channel   i   Busyly. 
—  1   —  1 

or 

t   P{Channel  i   Busy}   Xa^ 

_t  P{CPU  Busy}    =  ~y7~  =  Pi  (3.31) 

or,  approximately, 

Long-run  Busy  Time  for  Channel   i  „     ai  _  ,  . 

Long-run  Busy  Time  for  CPU     ~  y.   "  pi  U-Jl.aj 

This  approximate  equality  allows  us  to  estimate   p.,   the  input  to  our  evalua- 
tion equation,  from  monitor  data  that  records  total  CPU  and  Channel  times. 
We  shall  put  it  to  use  shortly. 

(d)   The  expected  number  of  programs  at  the  Channel  stage  can  be  calculated 
as  follows  (note  that  N  here  denotes  the  total  of  all  jobs  in  residence  at 
the  Channel  stage  in  the  long  run;  so   J  -  N   is  the  number  at  the  CPU) : 


N 

I      nP{N=n} 

n=0 

J 

I 

n=0 

c 

n      I 
i=l 

n 
a.p. 

1   - 

c 

I      ^ 
i=l     " 

J+1 
pi 
L   1-p. 

c        l-pJ(l+J-p.J) 

/   a.p.   tz T2 

i=l   1  x  (1-Pi)z 

J+1 

c     p 

1  -  I  a.  t±- 

i=l  x  ^i 


IS 


A  similar  formula  is  available  for  the  expected  number  present  at  each  of  the 
i  Channels. 

The  above  formulas  may  easily  be  evaluated  numerically.   Computer 
programs  for  so  doing  have  been  written  in  FORTRAN  and  run  on  the  IBM  360/67 
system  at  the  Naval  Postgraduate  School.   In  addition,  Professor  Neagle  Forrest 
has  programmed  the  two-channel  formulas  for  the  HP-65  hand-held  calculator. 
Extension  of  this  latter  program  to  more  complex  and  realistic  configurations 
would  allow  portable  use  of  the  models  by  sizing  teams. 
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4 .   Fitting  the  Models  to  Measurement  Data. 

Next  I  embark  upon  a  discussion  of  the  manner  in  which  some  actual 
program  material,  constructed  during  the  fall  of  1974  by  Mr.  B.  Wallack  of  JTSA, 
behaved  (a)  when  submitted  to  the  Honeywell  6000,  being  allowed  128K  of  core, 
and  (b)  was  estimated  to  behave  by  the  simple  analytic  (Markov)  models  that 
have  been  developed.   This  discussion  is  concerned  with  the  first  of  our 
attempts  to  validate,  or  at  least  test  the  consistency  and  perhaps  the  useful- 
ness of,  our  simple  models. 
A.   The  Program  Material. 

Program  material  submitted  to  the  H6000  consisted  of  40  programs  that 
required  CPU  processing  alternating  with  disc  file  accesses.   Each  program  was 
so  constructed  that  all  CPU  "burst"  lengths  were  identical,  but  burst  lengths 
varied  between  programs.   The  set  of  programs  was  submitted  as  a  batch  to  the 
H6000  and  the  following  items  were  recorded. 

(1)  Total  Job  (Problem  Program)  CPU  Time. 

(2)  Total  System  CPU  Time. 

(3)  Total  Disc  Time  (Single  Channel  Case). 

(4)  Total  Channel  Time  for  Each  Channel  (Two  Channels  to  Two  Discs  Case). 

(5)  Average  CPU  Queue  (including  item  being  processed) . 

(6)  Average  Channel  A  Queue  (including  item  being  processed;  apparently 
the  queue  was  examined  just  prior  to  each  moment  a  job  joined  the  channel 
queue) . 

(7)  Average  Channel  B  Queue  (same  as  (6)  above). 

(8)  Total  Turnaround  Time  (to  finish  all  40  jobs). 

(9)  Total  CPU  Idle  Time. 

(10)   Average  Number  of  Programs  in  Core  (later,  a  "core  map"  describing  the 
number  of  programs  in  core  at  30  second  intervals  became  available). 


17 


B.   Fitting  the  Models. 

It  is  possible  to  fit  the  models  to  the  data  furnished  by  making  use 
of  the  measured  data  described.   I  will  describe  such  fits,  and  comment  upon 
them. 

First  note  that  formulas  for  CPU  idleness  probability  use  as  basic 
parameter  the  ratio 

1 
—  ..  JL  -  Expected  Disc  or  I/O  burst  time 
u   1^      Expected  CPU  burst  time 

X 

Under  the  assumption  that  CPU  and  I/O  bursts  essentially  alternate  (certainly 
not  always  accurate)  there  are  very  nearly  as  many  CPU  bursts  as  I/O  bursts, 

so 

\   jw  Total  Disc  Time 
y  "Total  CPU  Time   ' 

fo^-  present  purposes  we  have  added  items  (1)  and  (2)  to  obtain  Total  CPU 
Time,  effectively  lowering  the  CPU  processing  rate  to  account  for  overhead. 
A  more  sophisticated  (but  still  simple)  model  has  been  constructed  for  over- 
head; we  do  not  attempt  to  fit  it  here. 

Next  observe  that  our  formulas  for  CPU  idleness  pretend  that  a  fixed 
number  J,   of  programs  is  in  core.   This  is  literally  untrue,  but  we  shall 
set  the  average  of  the  measured  number  in  core  equal  to  J,   later  examining 
the  effect  of  this  step  (better  possibilities  suggest  themselves). 

Finally,  we  will  calculate  CPU  utilization  via  our  formulas,  and  utilize 
the  latter  to  estimate  total  turnaround  time  for  the  40-program  job  stream. 
Other  comparisons  will  also  be  made. 
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C.   Actual  Cases. 

Case  1:   One  CPU,  One  Channel,  One  Device 

Job  CPU  Time  =  0.9830 

System  CPU  Time  =  0.0348 

Disc  Time  =  0.8407 

Average  Number  of  Jobs  in  Core  =3.1 
It  follows  that  we  estimate 


°-8507      =  0.836 


y   0.9830  +  0.0348 

We  may  use  the  basic  formula  (3.2)  and  approximate  relationship  (3. 31, a)  to 
estimate  CPU  idleness  probability.   The  value  obtained  is 

pQ  =  0.181 

Total  (expected)  turnaround  time  is  then  estimated  by  dividing  total  CPU 
time  by  CPU  utilization: 

„   •     ,  „       ,  m        i  m.     0.9830  +  0.0348 

Estimated  Expected  Turnaround  Time  = _   _  

1  —  (J .  lot 

1.0178 


0.819 


=  1.24 


Total  measured  turnaround  time  reported  was  1.41;  the  predicted  result 
was  about  12%  below  the  actual.   This  quality  of  agreement  is  encouraging, 
considering  the  discrepancies  between  the  model  and  the  actual  program  material 
It  is  worthwhile  to  search  for  explanation  of  the  discrepancy,  however. 
Sensitivity  to  Core  Occupancy,   J.   It  is  easily  possible  to  compute  the  effect 
of  various  core  occupancies;  we  do  so  for  only  two  additional  values: 
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d2  =  0.275, 


and 


d.  -  0.136 
4 


1.0178   .  /n 
Expected  turnaround  time,   J  =  2  =  fl  7 „ v   =  1.4U, 

1.0178 


Expected  turnaround  time,   J  =  4  = 


0.864 


=  1.18. 


If  we  graph  expected  turnaround  time  as  a  function  of  J   the  following  type 
of  curve  results: 


Expect. 
Turnaround 


12   3  j 

Consequently  if  we  have  the  probability  distribution  of  J,   and  average  first, 
and  use  the  latter  average  in  d   to  estimate  turnaround  time  we  obtain  a 
result  that  is  smaller  than  that  obtained  by  averaging  turnaround  time,  condi- 
tional on  J.   Thus,  it  may  be  that  careful  attention  to  the  core  occupancy 
stochastic  process  will  deliver  more  accurate  predictions  of  turn  around  time. 

In  order  to  study  this  effect  further  a  "core  map"  at  30  second  intervals 
was  made  by  B.  Wallack.   This  allowed  empirical  development  of  the  distribution 
of  core  content  for  the  data  and  configuration  under  study  here.   We  find 
(letting  p(J)   denote  the  long-run  probability  that  J  programs  are  in  core 
simultaneously)  the  following  numerical  values. 


Number  of  Programs  in  Core   (J) 

1 
2 
3 
4 
5 


Frequency  (estimated  p(J)) 

0.085 
0.235 
0.268 
0.386 
0.026 
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Under  the  apparently  valid  supposition  that  number  of  programs  in  core  (core 
occupancy)  changes  slowly  as  compared  to  "service,"  i.e.  CPU  and  I/O  activities, 
we  can  argue  that  for  fraction  of  time   p(J)   the  output  rate  is  essentially 
X[l-d  ],   so  expected  turnaround  or  throughput  time  requires  the  average  or 
expectation 


—    oo 


"V*Tlrh:  p. 


J=l     "J   J 

where   k  =  number  of  programs,  and   b  =  average  CPU  time  per  program.   The  JTSA 

kb 
experiment  essentially  estimates  —  =  total  CPU  time.   Now  for  the  particular 

data  in  question  we  then  computed   (1-d  )     for  J  =  1,...,5: 

1  1.8360 

2  1.3807 

3  1.2305 
A  1.1566 
5  1.1132 


Weighting  the  above  by  the  estimated   p's,   and  finally  multiplying  by  total 
CPU  time  yields  an  estimated  turnaround  time  of  1.31,  which  is  within  7%  of 
what  was  actually  observed,  although  still  low. 
Comments . 

Variation  in  core  occupancy  may  account  for  the  observed  discrepancy 
between  simple  formula  (constant  occupancy)  turnaround  estimates  and  actual 
measurements.   Required  are  (a)  methods  for  estimating  the  error  of  our  turn- 
around estimate,  since  the  latter  is  based  on  experience  with  finitely  many 
programs,  (b)  a  model  for  predicting  the  core  occupancy  distribution,   {p(J)}, 
when  core  size  is  changed,  (c)  checks  of  results  when  CPU  burst  times  and 
Disc  times  are  not  exponentially  distributed. 
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Case  2:   One  CPU,  Two  Channels  to  Two  Devices,  Programs  Utilize  One 

Device  Exclusively. 
Programs  were  so  written  as  to  make  use  of  just  one  device;  this  demand 
pattern  seems  to  make  the  model's  Markovian-random  demand  structure  less 
applicable.   Results  of  model  fitting  (assuming  constant  J)  are  as  follows. 
Measured  Total  CPU  Time       =  1.3044 
Measured  Total  Channel  A  Time  =  0.4089 
Measured  Total  Channel  B  Time  =  0.4425 


P1  =  0.3135, 


P2  =  0.3392, 


J  =  2.9 


(We  use     to  denote  an  estimate  from  data  available.)   If  these  estimates 
are  used  to  obtain 

Measured  Model  Output 

0.0136 


^0 
Turnaround 

CPU  Queue 


0.072 

1.32 

1.97 


1.41 
2.19 


Although  the  model-predicted  CPU  idleness  differs  from  that  measured  by  a 
factor  of  five,  the  turnaround  times  are  reasonably  close. 


22 


Case  3:   One  CPU,  Two  Channels  to  Three  Devices;  Programs  Utilize  One 

Device  Exclusively. 
The  system  configuration  in  this  case  allows  programs  to  access  two 
disc  devices  by  means  of  one  channel,  and  another  disc  by  means  of  a  second 
channel.   Core  size  was  apparently  increased  in  this  exercise  also,  for  the 
average  core  occupancy  was  slightly  over  six  jobs.   Results  obtained  from 
measurement  and  model  fitting  are  as  follows. 

Measured  Total  CPU  Time  =  1.1429 

Measured  Total  Channel  A  (16)  Time  =  0.6012 
Measured  Total  Channel  B  (8)  Time  =  0.2489 

p   =  0.5260,  P2  =  0.3392,  J  -  6.16 

If  these  estimates  are  used  as  input  we  obtain 

Measured  Model  Output 

Pn(CPU  Idleness)  0.0246  0.0122 

T  (Turnaround)  1.18  1.16 

CPU  Queue  4.86  4.71 

(expected) 

Device  Queue  1.30  1.29 

(expected) 

Although  the  assumptions  of  the  model  are  not  well  satisfied  for  this  particular 
workload,  the  model  and  measurements  generally  agree  well,  although  predicted 
idleness  probability  is  lower  than  that  measured  by  a  factor  of  two. 

Further  exercises  are  being  conducted  and  analyzed,  and  the  results  will 
be  reported  as  they  become  available.   In  general,  there  has  been  reasonable 
agreement  between  model-predicted  results  and  the  measurements  made  on  Honeywell 
equipment  that  have  been  furnished  to  me. 
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5.   Statistical  Considerations  in  Fitting  Computer  Models  to  Data. 

The  previous  section  described  the  quality  of  the  fit  obtained  when 
the  simple  Markovian  computer  model  was  fitted  to  actual  (Honeywell)  workload 
data.   The  purpose  of  this  section  is  to  consider  some  of  the  problems  of 
statistical  estimation  that  arise  when  considering  model  fitting  and  assess- 
ment. 

A.   Inferences  from  Measurements:   Sources  of  Error. 
A-l.   Sampling  Error. 

In  order  to  obtain  measurements  of  system  performance,  strings  of 
k  =  40  programs  were  run  through  each  actual  computer  configuration.   It  is 
a  truism  that  results  obtained  from  a  particular  set  of  40  programs  would  not 
be  exactly  reproduced  if  other  sets  of  similar  jobs  were  run. 

It  is  of  interest  to  use  the  output  data  actually  obtained  from  a  given 
sample  to  set  rough  confidence  limits  on  an  unknown  expected  turnaround  time. 
In  order  to  do  this,  the  following  steps  were  taken 

(i)   Mr.  Wallack  of  JTSA  furnished  me  with  the  clock  times  at  which 
each  of  the  40  jobs  left  the  system.   These  were  naturally  arranged  in 
increasing  order;  differences  between  successive  outputs  were  found  by  subtrac- 
tion. 

(ii)   The  time  differences  between  successive  system  departures  were 
examined  statistically.   In  particular  frequency  plots  were  made  of  the  dis- 
tribution of  the  time  between  successive  departures,  and  moment  estimators 
were  computed. 

(iii)   Estimates  of  mean  inter-departure  time  and  standard  derivation 
of  inter-departure  time  were  in  reasonably  close  numerical  agreement  (standard 
deviation  slightly  smaller  than  mean),  which  indicates  that  program  exits 
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occur  at  nearly  exponentially  distributed  time  intervals.   Such  is  what  one 
anticipate   if  a  "thinning"  mechanism  is  in  effect:   wif-h  small  probability 
8   a  program  exits  the  system  after  a  CPU  burst  (to  be  replaced  by  one 
enqueued)  while  with  probability   1  -  0  it  approaches  a  channel,  later  to 
return  to  the  CPU.   If  exit  events  are  independent,  i.e.  Bernoulli  trials, 
then  under  our  Markov  assumptions  actual  departures  should  be  close  to 
exponentially  distributed.   Assuming  equality  of  mean  and  standard  deviation, 
then,  we  conclude  that  total  turnaround  time,   T,   is  approximately  gamma 
distributed,  on  the  basis  of  which  deduction  confidence  limits  are  obtainable. 

In  fact,  total  observed  turnaround  time  estimates   E[T],   expected  turnaround 

i/2 

time,  and  vk   *  total  observed  turnaround  estimates   [Var[T]]    .   Since   k  =  40 

is  large  enough  to  justify  a  normal  approximation,  we  state  that,  with  approxi- 


mately 95%  confidence,   E[T]   lies  in  the  interval   [(observed  turnaround) 

A 


2 
(1  ±  —  )]  =  [(observed  turnaround)0. 68,  (observed  turnaround) (1. 32) ] 


On  the  basis  of  the  above  error  assessment  it  appears  that  a  sample 
of  40  jobs  is  sufficient  tc  estimate   E[T]   with  an  error  of  about  ±  30%; 
quadrupling  the  batch  size  will  reduce  this  error  to  ±15%,   with  95% 
confidence . 

It  would  seem  to  be  worthwhile  to  keep  the  above  figures  in  mind  when 
models  are  being  assessed — and,  for  that  matter,  when  systems  are  being  sized 
simply  by  running  representative  experimental  programs  and  judgmentally  inter- 
preting the  results.   Although  there  are  alternatives  to  the  error  assessment 
presented,  the  latter  is  simple  and  requires  only  the  total  turnaround  time 
statistic.   I  plan  to  study  more  complex,  and  possibly  more  accurate,  alterna- 
tives in  the  future. 
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Turn  now  to  a  consideration  of  other  possible  causes  of  real  or  apparent 
model  error. 

A-2.   Core  Occupancy  Tail-off  at  End  of  Run. 

Observe  that  if  we  knew  p.,   for   i  =  1,2, ...,c,   and  core  occupancy, 
J,   remained  constant,  and  further  that  our  models'  assumptions  (Markovian- 
exponential)  are  well  satisfied,  then  the  long-run  formulas  will  predict  E[T] 
accurately  provided  the  number  of  programs,   k,   is  "large"  enough  so  that 
transients  at  the  beginning  and  end  of  a  run  are  insignificant.   None  of  the 
abov°  assumptions  are  strictly  true,  and  an  examination  of  core  occupancy  data 
("core  maps")  shows  that  there  is  some  tailing  off  of  occupancy  as  the  end  of 
a  run  is  approached.   Just  how  serious  this  effect  is  upon  our  estimates  is 
under  investigation  at  JTSA.   Such  tail  off  should  tend  to  artificially  prolong 
experimental  runs. 

A-3.   Core  Occupancy:   Slow  Variations. 

The  assumption  that  the  number  of  programs  in  core  is  fixed  and  equal 
to  the  expected  (average)  number  of  jobs  in  core  is  an  oversimplification, 
and  tends  to  produce  a  constant  bias,  as  has  been  remarked.   A  model  that 
addresses  this  problem  is  under  construction. 

A-4.   Program  Material  Statistics. 

If  CPU  and  Channel-Device  burst  times  do  not  resemble  independent 
exponential  random  variables  then  systematic  prediction  errors  are  likely  to 
occur.   In  particular,  if  CPU  bursts  tend  to  be  hyper-exponential  (skewed, 
long-tailed)  appearing,  as  may  be  the  case  when  mixtures  of  programs  are 
present,  then  the  Markcv  model  understates  CPU  idleness  and  turnaround  time; 
see  Gaver  [2],  and  Gaver  and  Shedler  [4].   Before  a  more  refined  model 
can  be  fitted,  however,  further  measurements  must  be  made  and  interpreted. 
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6.   A  Markovian  Model  that  Includes  Overhead. 

The  purpose  of  this  section  is  to  derive  a  simple  model  for  a 
multiprogramming  computer  system  that  explicitly  includes  some  of  the 
possible  effects  of  "overhead"  actions  by  the  CPU.   It  will  be  assumed  that 
at  the  termination  of  every  activity  "burst,"  whether  it  be  CPU  or  I/O,  the 
CPU  enters  an  "overhead"  state,  residing  there  for  a  period  long  enough  to 
carry  out  activity  so  described. 

In  order  to  keep  the  analysis  and  formulas  simple  the  derivations 
will  be  carried  out  for  a  Markovian  system.   Our  objective  is  to  see,  at 
least  qualitatively,  how  overhead  decreases  overall  system  productivity. 
The  present  results,  although  based  on  oversimplified  assumptions,  are  per- 
haps a  degree  more  realistic  than  is  the  simple  ad  hoc  procedure  of  reducing 
CPU  production  rate,   X,   by  an  empirically  derived  factor. 
A.   Derivation. 

Let   N(t)   denote  the  number  of  programs  at  (waiting,  and  being  served 
by)  the  CPU  at  time   t.   Let 

P.(t)  =  P{N(t)  =  j,   and  CPU  doing  problem  program  computing}. 
Q.(t)  =  P{N(t)  =  j,   and  CPU  performing  overhead}. 

Let  v    be  the  expected  duration  of  an  overhead  activity  (any  overhead 
activity,  according  to  our  present  simple  setup).   Furthermore,  assume  that 
at  the  termination  of  any  problem  program  work,  be  it  at  CPU  or  I/O,  the  CPU 
is  immediately  preempted  for  a  random,  exponentially  distributed  time  of  mean 
duration  v 

If   j   programs  are  at  the  CPU,  let  X.      denote  the  CPU  program  burst  com- 
pletion rate,  and  let  u.   be  the  corresponding  rate  for  I/O.   Later  on  we  can 

fill  in  specific  functions  for  X.      and  y.;   at  present  it  seems  unnecessary. 

3        J 

27 


Now  write  down  the  basic  probability  equations  for  P.(t)   and  Q.(t): 

■J  J 

P.(t+dt)  =  P.(t)[l-(X.+y.)dt]  +  vQ.(t).  (6.1) 

the  latter  following  because  if  the  system  is  in  CPU-active  states   j+1  or 
j-1,   and  CPU  or  I/O  completes,  then  the  system  can  only  go  into  CPU-overhead 
state  j,   and  thence  to  active  state  j.   Next 

Q.(t+dt)  =  Q,(t)[l-vdt]  +  P   (t)X   dt  +  P.  ,(t)y.  ,dt;     (6.2) 

the  above  explanation  covers  the  latter  equation. 

Conversion  to  differential  equations,  and  thence  to  balance  equations, 
dP  .       dQ . 
is  routine  (set  — r- *-  =  0,   — p*-  =0   to  get  balance).   The  balance  equation 

results  are : 

WV  =  VV      vqj  *  Ji«vi +  mj-ipj-i  <6-3) 

Therefore,  equating  vq .   terms  gives 

VYWVi  +  Vipi-r  (6-4) 


Start  with 


and  repeatedly  solve 


X0  =  °  "  *-l 


Vo  =  xipi  (6*5) 


yo 


and 


pi  =  po  xx 


P3  =  P0  XX      ...  X  (6'6) 


X.+y.       yn  p-  .. .  y     Xa+^a 
J  1   2      j 
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Finally. 


I     (p.+q.)   =  I-  (6.8) 

j=0   J   J 


B.   Example. 

Suppose   J   jobs  multiprogram  and  make  use  of  a  single  channel  to 
various  devices.   Therefore,   X.  =  X,   y.  =  u,   and 

Pj=Po©J 

from  (6.6)  and  (6.7).   Summing  up  from   j  =  0   to   J   gives,  with  the  aid 
of  (6.8), 

1  -£ 

Po  =  ^k   ST  (6-10) 


i-ffl 


The  long-run  fraction  of  time  during  which  the  CPU  is  concerned  with  overhead 


activity  is 


1=0  j=0 


X+y 
X+y+v 


(6.11) 
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7.   Conflict  on  Channels  Leading  to  Memory  Devices. 

A.   The  Setup:   Two  Channels  to  Several  Equi-Demanded  Discs 

Consider  the  following  configuration,  describing  a  system  with  a 
CPU,  several  memory  devices  (e.g.,  discs),  and  two  channels  by  way  of 
which  the  devices  are  accessed. 


C  PU 


DiSCS 


In  conformity  with  Markov  assumptions  let   A  be  the  CPU  service  rate, 
so   X    is  an  expected  CPU  burst,  and  let   u    be  the  expected  device 
burst. 

It  will  be  assumed  first  here  that  at  the  termination  of  any  CPU 
burst  the  program  is  equally  likely  to  proceed  to  any  device.   There  may 
well  be  an  attempt  to  ensure  this  type  of  behavior  by  shifting  filed 
information  around  to  equalize  device  appeals.   Other  assumptions 
(unequal  probabilities)  are  more  difficult  to  handle,  and  must  wait. 
They  can  be  handled  in  an  analytical  framework,  but  more  equations  must 
be  solved. 

Further  assume  that,  so  long  as  there  is  no  conflict  for  a 
particular  device's  information,  the  rate  of  service  is — where  n 
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now  denotes  the  number  of  programs  in  the  channel-device  stage — 


Un  =<  u   if   n  =  1  (7.1) 


If,  on  the  other  hand,  a  request  comes  from  the  CPU  for  information  from 

the  same  disc  or  one  already  in  use,  and  reaches  the  head  of  the  device 

line,  then 

(  0   if   n  -  0 

%-        .f         ,  (7-2) 

y   if   n  ^  1. 

If  there  are   D  devices  in  all,  the  chance  of  a  conflict  is  assumed  here 
to  be  D   ;   this  assumption  will  be  changed  if  necessary. 

Assume  also  that  service  is  in  arrival  order,  so  that  if  a  channel 
is  connected  to  device   i   in  order  to  service  a  particular  program,  and 
if  a  program  immediately  behind  the  one  being  served  also  requires  the 
disc  in  question,  then  effectively  the  second  channel  is  blocked. 


B-   Probability  Model 

Let  (a)   P  (t)   denote  the  probability  that   n   programs  are 
n 

present  at  the  device  stage  (waiting  and  in  service)  and  both  channels 
are  available  (no  blocking) ,  while  (b)  B  (t)  denotes  the  probability 
of  the  same  event,  save  for  the  difference  that  one  channel  is  blocked. 

Let   A   be  the  CPU  burst  rate   (A    =  expected  CPU  burst  time), 
and   y  '   be  the  expected  time  for  which  a  disc  is  continuously  engaged 
by  a  single  program.   We  let  the  probability  that  a  program  is  blocked 
be   3   (3  =  — ,   D  being  the  number  of  discs),  and   3  =  1-3- 
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B-l  Probability  Equations 


(1)    P0(t+dt)  =  PQ(t)[l-Adt]  +  P1(t)Mdt  +  o(dt),  (7.3) 


for  no  blocking  is  possible  when  n  =  0;   the  first  term  on  the  right 
hand  side  (RHS)  represents  the  probability  that  the  number  of  programs 
at  the  device-wait  stage  is   0,   and  no  new  arrival  occurs  in   (t,t+dt) 
while  the  second  is  the  probability  that   1  was  present,  occupying  a 
channel,  but  terminated  in   (t,t+dt). 


(2)    P1(t+dt)  =  P1(t)[l-(X+y)dt]  +  PQ(t)Adt  +  P2(t)2ydt  +  B2(t)ydt 

+o(dt).  (7.4) 


The  only  term  requiring  remark  is  the  last  on  (RHS):   B~(t)   is  the 
probability  that   2  are  present,  but  only  one  channel  is  in  use  (blocking), 
but  in   (t,t+dt)   the  one  in  service  leaves,  and  either  channel  is  now  open. 


(3)    P2(t+dt)  =  P2(t)[l-(A+2u)dt]  +  P1(t)X6dt  +  P3(t)2y6dt 

+  B3(t)yBdt  +  o(dt).  (7.5) 


The  first  RHS  term  represents  no  change  from  2.  The  second  term 
represents  the  probability  of   1  present  and  a  non-blocked  arrival.   The 
third  term  means  that   3  are  present,   2   of  which  are  in  service;  a 
departure  occurs,  and  the  3 —  is  non-blocking.   The  fourth  term  means 
3  present,  one  channel  blocked  hence  one  in  service,  a  departure  occurs 
in   (t,t+dt)   and  the  next  two  do  not  require  the  same  disc. 

In  general,  for  n  >  2  an  arrival  doesn't  enter  service,  so 
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(4)  Pn(t+dt)  =  Pn(t)[l-(A+2y)dt]  +  Pn_1(t)Xdt  +  Pn+1(t)2p8dt 

+  Bn+1(t)uBdt  +  o(dt)      (7.6) 

Turn  now  to  equations  for  B  (t).   There  can  be  no  blocking  for 
n  =  0   or   1,   so  BQ(t)  =  B..(t)  =  0.   We  omit  writing  o(dt)  . 

(5)  B2(t+dt)  =  B2(t)[l-(X+u)dt]  +  P1(t)A3dt  +  B3(t)u3dt  +  P3(t)2uBdt.    (7.7) 

The  first  term  represents  no  change  (only  one  is  in  service).   The  second 
represents   1   in  service  and  the  arrival  of  a  program  requiring  the  same 
disc.   The  third  represents   3   present,   1   in  service  and  one  channel 
blocked;  the  departure  of  the  latter  finds  that  the  next  in  line  is  blocked. 
The  fourth  means   3  present,   2   in  service;  one  departs,  and  the  third  is 
blocked. 

For  n  ^  3, 

B  (t+dt)  =  B  (t)(l-(A+u)dt]  +  B   ,(t)Adt  +  B  ^(t)uBdt 
n  n  n-1  n+1 

+  Pn+1(t)2y3dt  (7.8) 

To  derive  differential  equations,  subtract   P  (t)   or   B  (t)   from 

n  n 

the  RHS  as  appropriate,  divide  by  dt   and  let   dt  ->-  0.   Balance  equations 
follow  by  setting  derivatives  equal  to  zero. 

(I1)    ApQ  -  upx 

(2')    (A+y)P;L  =  ApQ  +  2yp2  +  Mb2 

(3')    (A+2u)p2  =  A6p1  +  2y6p3  +  y6b3  (7.9) 


n 


(4')    (A+2y)pn  =  Apn_1  +  2y3pn+1  +  y$bn+1 

(4',a)   2yp  =  Ap   -        for  n  =  J 

(5')         (A+p)b2   =   \$Pl  +  y6b3  +  2y3p3 

(6f)         (A+y)b      =   Ab      .    +  y6b    ,,   +  2y3p    .-  ,      J-l  £  n  ^  3 
n  n-1  n+1  n+1 

(6",a)      yba  =   XbJ^1 

Special  Case.   Let  the  number  of  programs  in  the  system  be  J  =  2.   Then 
we  can  solve  (!'),  (2'),  (3'),  and  (5T)  simultaneously: 


Pi  =  77  Pn>  T7  Pn  =  2^P?  +  ^b9  »   and  ^bo  =  x^Pi 


'1   y  F0>   y  ^0 


'2    M"2 


yield 


Pn  = 


x  +  A  +  (i\2£  +  (^ 

y   \y/  2   Vy 


b„  = 


y)3p0'   p2  =2  (v)  *V 


The  probability  of  CPU  idleness  is  b_  +  p_,   so 


Expected  CPU  utilization  = 


y 


y 


2r^  1 


■+f 


(7.10) 


Numerical  Example.   Program  material  synthesized  by  B.  Wallack  of  JTSA 
yielded  the  values 
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=  1.33,   -  =  1.4  *  1.15  =  1.61 


-  =  0.826 


Our  formula  for  J  =  2   shows  that  if   6  = 


D    //  of  disc  units' 


so  if 


Expected  CPU  utilization 


1.83 


1.83  +  0.689 


D+l 


2D 


D  =  2,   Expected  CPU  utilization  =  0.78,   while  if 
D  =  4,   Expected  CPU  utilization  =  0.81,   and  if 
D  =  °°,   Expected  CPU  utilization  =  0.84 


In  order  to  find  CPU  idleness  probability  in  the  general  case, 

we  must  find   p  +  b  .   This  can  be  done  by  solving  equations  (l')-(6') 

J       J 
simultaneously,  subject  to  the  condition  that    J  p,  +   £  b.  =  1.   There 

j=0  J   j=0  J 
are  in  all   (J+l)  -1-  (.T4-l-2)  =  2J   linear  equations  to  be  solved;  any 

available  package  program  for  linear  equation  solutions  should  be 

applicable. 

Another  appealing  possibility  for  obtaining  a  solution  is  to  use 

an  iterative  procedure  of  successive  corrections. 
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C.   Approximations 

The  simplicity  of  the  Markovian  channel  problem  suggests  that  a 
good,  explicit,  approximate  solution  may  be  available.   One  thought  is  to 
examine  the  Disc  stage  at  times  when  blocking  can  occur,  i.e.  when  *:  2 
programs  are  awaiting  Discs  (of  course,  one  may  be  blocked). 

Suppose,  then,  that  the  Disc  stage  is  "flooded, "  i.e.  that  the  CPU 
is  fast  enough  to  keep  the  latter  constantly  working.   Let  P(t)   denote 
the  probability  that  the  channels  are  both  busy  at   t,   and  B(t)   denote 
the  probability  that  one  is  blocked;  clearly  P(t)  +  B(t)  =  1  when 
flooding  occurs.   Now 

P(t+dt)  =  P(t)[l-2ydt]  +  P(t)2u3dt  +  B(t)u3dt  (7.11) 

which  leads  to 

^  =  -2y  P(t)  +  2u3  P(t)  +  yl  B(t) 

=  -2y3  P(t)  +  y3[l-P(t)].  (7.12) 

To  find  the  steady  state  probability 


p  =  lim  P(t) 

set  the  derivative  equal  to  zero  and  solve: 

!_  I 

n  ..   3   _  1-3  _  D  _  D-l 

and 

b  =  lim  B(t)  =1-P=3Tj|-=DTr-  (7.14) 
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Consequently  the  steady  state  output  rate  from  the  "flooded" 
channels  is 


2yp  +  ub   =   2y  (— f)    +  y 


=  2y 


D+P    "  EH-1 
D 


D+l  ' 

and  the  effective  output  rate  per  channel  is 

~      D 

y  =  y  wT 

Now  approximate  the  Disc  stage  service  process  as  follows: 


y1  -y 

y   =  2~      n  i>  2 
n 

and  treat  the  delays  at  the  Disc  stage  queue  as  simple  Markovian,  with 

arrivals   A   as  before  and  the  above  service  rate.   Let  d   denote  the 

n 

steady    state   probability    that      n     programs   are   present   at   Disc   Stage 

(d     ~  p      +  b    ).      If   this   assumption   is  made, 
n  n  n 

\n    X  X„    ...    A      , 

d      .   d        0      1  2 S-l  (7.15) 

n  0  y     y_  yQ    . . .    y 

12  3  n 


so 


di    =    dn  A 
1         0  y 


a„  =  d 


2  °2P 

2y 


Since 
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Since 


1  =  T   d   =  d 


n=0 


n    0 


\      J       \     n_1 
M  n=l  2y 


d„  = 


AiJ 


1  + 


"  1 1  -  A.  ( 

2p 


and  the  probability  that  all  J  programs  are  at  the  Disc  stage  is 


dT   = 


y    2y 


J-l 


"ll  -A- 

2y 


(7.16) 


then     1  -  d     =  CPU  utilization. 

J 


Special   Case.      Let     J  =   2.      Then 


X       X 


a  v  2» 

2    1  +  ^(1  +  -^ 

2y 


and 


1  -  d„  = 


y 


V        y    2y 


1  + 


1  +  A  +  WrD±ii 

y        2  V     L  D   > 
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which  agrees  with  the  answer  found  by  using  the  correct  Markovian  analysis 
for  this  case.   It  thus  seems  reasonable  to  use  the  approximate  result 
d    in  order  to  estimate  CPU  utilization. 

J 
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